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Description 

The mammalian immunological syst m is unique in its^ broad ability to produce protein compounds, 
known as immunoglobulins, having extremely high specificity for a particular molecular stnjcture. That is» 

5 these proteins have a conformation which is specifically able to complement a particular structure, so that 
binding occurs with high affinity. In this manner, the mammalian immune system is able to respond to 
invasions of foreign molecules, particularly proteins in surface membranes of microorganisms, and to toxins, 
resulting in detoxification or destruction of the invader, without adverse effects on the host 

The primarily immunoglobulin of the defensive mechanism is gamma-globulin (IgG). This im- 

10 munoglobulin, which is a glycoprotein of about 150,000 daltons, has four chains, two heavy chains and two 
light chains. Each chain has a variable region and a constant region. The variable regions are concerned 
with the binding specificity of the immunoglobulin, while the constant regions have a number of other 
functions which do not directly relate to the antibody affinity. 

In many situations it would be desirable to have binding molecules which, though substantially smaller 

75 than the immunoglobulins, still provide the specificity and affinity which the immunoglobulins afford. Smaller 
molecules can provide for shorter residence times in a mammalian host. In addition, where the im- 
munoglobulin has to be bound to another molecule, it will be frequently desirable to minimize the size of the 
final product. Also there are many economies in being able to produce a smaller molecule which fulfills the 
function of a larger molecule. 

so There are situations where it is desirable to be able to have a large number of molecules compactly 
held together. By having smaller molecules, a greater number can be brought together into a smaller space. 
Furthennore, where such a binding molecule can be prepared by hybrid DNA technology, one has the 
opportunity to bind the binding portion of the molecule to a wide variety of other polypeptides, so that one 
can have the binding molecule covalently bonded at one or both ends to a polypeptide chain. 

25 Where immunoglobulins are used in in vivo diagnosis or therapy, antisera from an allogenic host or 
from a monoclonal antibody may be Immunogenic. Furthermore, when conjugates of other molecules to the 
antibody are employed, the resulting conjugate may become immunogenic and elicit host antibodies 
against the constant region of the immunoglobulin or against any other part of the molecule. 

It is therefore important that methods be developed which permit the preparation of homogeneous 

30 compositions that comprise such binding molecules and have high specificity for a particular antigen or 
ligand but avoid the shortcomings of complete immunoglobulins and also afford the many advantages of 
lower molecular weight. 

Discussions concerning variable regions of heavy and light chains of immunoglobulins may be found in 
Sharon and Givol. Biochem. (1976) 15:1591-1594; Rosenblatt and Haber, Biochem. (1978) 17:3877-3882; 
35 and Early and Hood, Genetic "Engineering (1981) 3:157-188. Synthesis of part of a mouse immunoglobulin 
light chain in a bacterial clone is described by Amster et aL. Nucleic Acids Res, (1980) 8:2055-2065. 
Various references cited throughout the specification concern particular methodologies and composifions. 

EP-A-0043980 is concerned with a process for the preparation of a polypeptide comprising the amino 
acid sequence of mature human leukocyte interferon. 
40 An article entitled "Multiple related immunoglobin variable-regions identified by cloning and sequence 
analysis". Proc. Natl. Acad. Sci. U.S.A., Vol. 75, pages 3881-3885 describes the cloning of a DNA sequence 
coding for the variable region of an immunoglobin molecule. 

Neither document discloses or suggests the specific binding compositions to which the present 
invention relates. 

45 According to the present invention there is provided a specific binding composition comprising two 
polypeptide chains of an immunoglobulin having binding specificity to a predetermined ligand, characterized 
in that said two polypeptide chains 

(1) have the amino acid sequence of at least a portion of the variable region of an immunoglobulin, but 
substantially lack the constant region. 
50 (2) associate to form a complex having a high affinity and specificity for said predetemnined ligand, and 
(3) are the light chain of from about 95 to 115 amino acids and the heavy chain of from about 110 to 125 
amino acids, wherein said heavy chain includes the D-region. 

The polypeptide chains can be obtained by cultivation of genetically engineered microorganisms. 
Employing hybrid DNA technology, cDNA is tailored to remove nucleotides extraneous to the variable 
55 regions of the light and heavy chains. The resulting tailored ds cDNA is inserted into an appropriate 
expression vector which is then introduced into a host for transcription and translation. The resulting 
tmncated light and heavy chains define at least a major portion of the variable regions and associate to form 
a complex capable of specifically binding with high affinity to an antigen or ligand at a haptenic side thereof. 
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The binding constant will g nerally be greater than 10^, more usually gr ater than 10^, and preferably 
greater than 10^. 

Generally the polypeptide chains of the variable regions of the light and heavy chains will be employed 
together for binding to the ligand. However, it may exceptionally be possible to use a single chain if this 
5 chain has sufficient binding affinity to the ligand in question. 

The two polypeptide chains which, individually or together, provide the compositions of this invention 
will form a receptor site, analogous to the binding site of an immunoglobulin. The composition will be 
referred to as an rFv with the individual chains referred to as L-rFv or K-rFv. The L- and H- designations will 
normally mean light and heavy respectively; sometimes the two chains may be the same and derived from 
10 eitiier the light or heavy chain sequences. The polypeptide chains of the rFv will generally have fewer than 
125 amino acids, more usually fewer than about 120 amino acids, while normally having more than 60 
amino acids, usually more than about 95 amino acids, more usually more tinan about 100 amino acids. The 
H-rFv will be from about 110 to 125 amino acids while the L-rFv will be from about 95 to 115 amino acids. 
The amino acid compositions will vary widely, depending upon the particular idiotype involved. Usually 
75 there will be at least two cysteines separated by from about 60 to 75 amino acids and joined by a disulfide 
link (forming cystine) to define a domain. The two chains will normally be substantial copies of idiotypes of 
the variable regions of tiie light and heavy chains of immunoglobulins, but In some situations it may be 
sufficient to have combinations of eitiier tiie light or the heavy variable region chains. 

It will often be desirable to have one or both of the rFv chains labeled, e.g. with a radiosotope , 
20 fluorescer. or toxin, or bound to an inert physiologically acceptable support, such as syntiietic organic 
polymers, polysaccharides, naturally occurring proteins, or other non-immunogenic substances. 

It may sometimes be desirable to provide for covalent crosslinking of the two chains, e.g. by providing 
, for cysteine residues at the carboxyl termini. The chains will normally be prepared free of the constant 
regions; the J region may be present in part or in whole, or absent. The D region will be included in the 
25 transcript of the K-rFv. 

Generally only a relatively small percent of the total amino acids will vary from idiotype to idiotype in 
the rFv. Therefore, there will be areas providing a relatively constant framework and areas that will vary, 
namely, the hypervariable regions. 

The C-terminus region of the rFv will have a greater variety of sequences than the N-terminus and. 
30 based on the present invention, can be furtiier modified to permit variation from the naturally occurring 
heavy and light chains. A synthetic oligonucleotide can be used to introduce mutations encoding different 
amino acids in a hypervariable region. 

The preparation of the rFv by means of hybrid DNA technology will first be described in general temns 
and then in greater detail. 

35 To provide a homogeneous rFv having high binding affinity, the mammalian immune system can be 
used as starting point. The messenger^"^ from a hybridoma cell or otfier monoclonal antibody-producing 
cell is isolated and used to prepare a cDNA transcript encoding the light and/or heavy chains of the 
immunoglobulin. Based on the flanking sequences upstream and downstream, at the start (maybe Including 
leader region) and finish of the DNA encoding the variable region, short DNA sequences (oligonucleotides) 

40 at least partially complementary to those sequences are employed for primer repair or in vitro mutagenesis 
to remove extraneous flanking regions and to introduce translational control signals. The in vitro 
mutagenesis employs an oligonucleotide, which heteroduplexes with one of the strands of the cDNA, in 
combination with Klenow fragment of DNA polymerase I. Primer repair requires a homoduplexing 
oligonucleotide in combination with the same enzyme. The process is carried out twice (conveniently once 

45 with the coding strand and once witii the non-coding strand) to provide ds cDNA coding for the variable 
region with translational regulatory signals at predetermined sites. This ds cDNA is inserted into an 
appropriate vector, e.g. plasmid, to provide a hybrid vector capable of self-replication and having the proper 
regulatory signals for replication, selection and expression. 

This hybrid vector is then introduced into an appropriate host to express the variable regions of the 

50 heavy or light polypeptide chains of tiie rFv and the polypeptides are isolated. The variable regions of the 
heavy and light polypeptide components of the rFv are then associated in an appropriate medium to form 
the rFv. 

Since the idiotypes vary, the sequence of steps of the present invention permits one to handle a wide 
variety of coding sequences for variable regions. Also, the ds cDNA and vector can be tailored to optimize 
55 the regulatory signals which are employed, particulariy the promoter. The ribosome binding site and 
variable-region initiation codon may be properly spaced to optimize expression , of the variable-region 
polypeptide. 

The hybrid vectors containing the variable region coding sequence in the proper orientation are used to 
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transform the appropriate host for expression. The resulting transformants are selected by virtue of the 
maricers present in the vector and then cloned and expanded. The polypeptide produced by the transfor- 
mants may be isolated by separation of the cells and ijgolation of the supernatant into which such 
polypeptides are secreted; or, if tiie polypeptides are not secreted, the transformant cells are isolated and 

5 lysed. and tiie polypeptide is recovered. Fractions containing enhanced amounts of the variable region 
polypeptide may be obtained by various conventional techniques, such as gel electrophoresis, fractional 
preciptation, affinity chromatography, high pressure liquid chromatography, or the like. In any event, the 
original lysate, or supernatant, or the concentrated fractions therefrom, may be screened for the presence of 
the variable-region polypeptides by immunoassay. 

10 A heavy or light chain that is secreted may be isolated as follows. Polyclonal ahtisera to monoclonal 
Immunoglobulin can be prepared by immunizing an appropriate vertebrate with the whole monoclonal 
antibody, so as to produce antiserum which recognizes the determinant sites of the heavy and light chains. 
Antibodies recognizing the whole immunoglobulin or only tiie light or heavy chain may be substantially 
separated and purified from otiier antibodies in the antiserum. By binding to and eluting from affinity 

75 columns containing whole immunoglobulin, or only tiie heavy or light chains, covalentiy linked to an 
appropriate support, tiie antibodies for the whole immunoglobulin, or for tiie heavy or light chain respec- 
tively, become bound to the column. The column is denatured, and the purified antibodies are removed and 
Uien conjugated to an appropriate support to provide an affinity column to purify the heavy or light chains of 
the rFv. 

20 Where tiie light or heavy chain is not secreted, tiie transformed microorganisms containing the 
appropriate ds cDNA for either light or heavy chains are grown in liquid cultures and cleared lysates are 
prepared, e.g. by treatment of tiie microorganisms with lysozyme. rupture of the cell membrane, cen- 
trifuging and collecting the clear liquids These lysates are then passed over an immunosorbent affinity 
column prepared as described above, employing the specific polyclonal antisera. The bound variable 

25 regions are eluted from the column with an appropriate denaturing solvent The eluates from each of the 
heavy and light chain isolations are pooled and tiien treated to renature tiie polypeptides to form L-rFv and 
H-rFv respectively. For renaturation, the pooled eluates may be dialyzed against appropriate aqueous 
buffered solutions. The mixture is then further purified by passing it over the appropriate iigand-affinity 
column and the bound molecules eluted with an appropriate denaturing solvent. The variable regions are 

30 then renatured as previously described to provide a solution of rFvs wiiich may be used for a variety of 
purposes. 

In accordance with the present invention, molecules are provided which are polypeptide duplexes of the 
variable region of light and heavy chains of immunoglobulins, retaining the specificity of the im- 
munoglobulins. By lacking the constant regions, the rFvs are less immunogenic and may, therefore, be 

35 prepared from sources xenogenic to a host to which they are to be administered, Furtiiermore, the rFvs are 
a homogeneous mixture, rather than a heterogeneous mixture. (The heterogeneous mixtures, which will 
contain chains of varying lengtiis, could be obtained by other techniques, such as enzyme and acid 
treatment.) The homogeneity of the compositions of the present invention allows for uniform modification 
and accurate determination of therapeutic levels. In addition, there is no contamination with chains from 

40 whole immunoglobulins, which, if inadequately digested, would retain immunogenic portions or uncover new 
immunogenic sites. Rnaliy. large amounts of the desired rFvs may be prepared in -high yield and high 
purity. 

The present invention provides furthermore appropriate transformed expression vectors or plasmids 
can7ing a ds DNA sequence coding for said rFvs; transformed hosts (such as bacteria, e.g. B coli, or 

45 yeasts) canying such expression vectors; methods for preparing such transformed expression vectors or 
plasmids; and methods for preparing said rFvs by cultivating such transformed hosts. 

The transformed expression vector or plasmid according to the invention carries a ds DNA sequence 
that codes for a variable region of a light or heavy chain of an immunoglobulin specific for a predetemnined 
ligand but lacks nucleotides coding for aminoacid residues superfluous to said variable region and is 

50 equipped for initiation and termination codons at the 5'- and 3'-termini respectively of said DNA sequence. 

The ligand may be for example an enzyme or a surface protein. The ds DNA sequence may code for 
example for 2 variable region of a chain having about 95 to 125 amino acids, in particular for a variable 
region of a light chain having 95 to 115 amino acids or for a variable region of a heavy chain having about 
1 10 to 125 amino acids, especially for at least the D region of the heavy chain. 

55 The invention further provides a method for preparing a transformed expression vector or plasmid which 
cames a ds DNA sequence that codes for a variable region of a light or heavy chain of an immunoglobulin 
specific for a predetermined ligand but lacks nucleotides coding for amino acid residues superfluous to said 
variable region and is equipped for initiation and termination codons at the 5'-and 3*-termini respectively of 
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said DNA sequence; 
said method comprising: 

preparing ds cDNA encoding at least one of said light or heavy chains from an mRNA coding for said chain; 
removing nucleotide sequences from said ds cDNA superfluous to said variable region and providing for 
5 initiation and tenmination codons at the 5'- and 3'-temiini respectively of the DNA sequence to provide 
tailored ds cDNA encoding said variable region; 

and inserting said tailored ds cDNA into an expression vector for expression of said ds cDNA. 

The initiation and termination codons can be provided by in vitro mutagenesis. If desired, the method 
may include the additional step, prior to said inserting, of replacing at least one nucleotide in said ds cDNA 
70 to change a codon to encode for a different amino acid, 

A particularly prefenred embodiment of this method comprises the following steps a) to f): 

a) preparing ds cDNA coding for a light or heavy chain of an Immunoglobulin, each chain being 
composed of a constant region and a variable region, said variable regions having about 95 to 125 amino 
acids, by the steps of isolating mRNA that codes for said chain, reverse-transcribing said mRNA to 

75 produce ss cDNA, synthesizing a strand complementary to said ss cDNA by means of DNA polymerase 
to produce ds cDNA having a coding strand coding for said light or heavy chain, wherein said coding 
strands include DNA sequences that code for leader sequence, variable region and constant region of 
said immunoglobulin in the 5'-3* direction of said coding strand, and cloning said ds cDNA; 

b) providing a coding or non-coding ss cDNA strand from said cloned ds cDNA; 
20 and then carrying out step' c), d), e) and f) in the order decf or cedf: 

c) hybridizing to the non-coding strand at the juncture of the coding sequences for the leader region and 
variable region a first oligonucleotide primer having an initiation codon for defining the initiation site to 
produce a first duplex, enzymatically treating this duplex to elongate said first oligonucleotide primer In 
its 5'-3* direction complementary to said non-coding ss cDNA, and digesting said non-coding ss cDNA in 

25 the other direction up to the sequence complementary to said first oligonucleotide. primer, to produce N- 
termlnus defined ds cDNA; 

d) hybridizing to the coding strand at the DNA sequences coding for the juncture of the variable region 
and the constant region a second oligonucleotide primer that includes a stop anti-codon to produce a 
second duplex, enzymatically treating this duplex to elongate the second oligonucleotide primer in its 5'- 

30 3* direction complementary to said coding strand and digesting said coding ss cDNA in the other 
direction up to the sequence complementary to said second oligonucleotide primer, to produce O 
terminus tailored ds cDNA; e) cloning the resulting ds cDNA with its C- or N-terminus defined; separating 
the resulting ds cDNA with its C- or N-terminus defined into coding and non-coding strands; and using 
said coding strand if step d) follows but said non-coding strand if step c) follows; 

35 

and f) cloning the resulting N- and C-terminus tailored ds cDNA; and inserting said N- and C-terminus 
tailored ds cDNA into an expression vector or plasmid with said coding sequence in proper relationship 
with transcriptional and translational regulatory signals. 
A prefenred embodiment of this method comprises: 
40 A) preparing ds cDNA coding for a light or heavy chain of an immunoglobulin, each chain being 
composed of a constant region and a variable region, said variable regions having about 95 to 125 amino 
acids; 

by the steps of isolating mRNA that codes for said chain, reverse-transcribing said mRNA to produce ss 
cDNA. synthesizing a strand complementary to said ss cDNA by means of DNA polymerase to produce 

45 ds cDNA having a coding strand coding for said light Or heavy chain, wherein said coding strands 
include DNA sequences that code for leader sequence, variable region and constant region of said 
immunoglobulin in the 5'-3* direction of said coding strand, and cloning said ds cDNA; 
B) removing at least a portion of the DNA coding for the regions flanking said variable region of said light 
or heavy chain by separating the cloned ds cDNA into coding and non-coding strands; 

50 hybridizing to the non-coding strand a first oligonucleotide primer having an initiation codon for defining 
the initiation site for expression of a variable region, said first oligonucleotide primer being complemen- 
tary to the sequence coding for the N-terminus of the leader region or partially complementary to the 
DNA sequence coding for the juncture of the leader region and variable region, having a non- 
complementary initiation codon about at said juncture, to produce a first duplex, enzymatically treating 

56 the resulting duplex to elongate the first oligonucleotide primer in its 5'-3* direction complementary to 
said non-coding ss cDNA, and digesting said non-coding ss cDNA in the'^^^er direction up to the 
sequence complementary to said first oligonucleotide primer, to produce N-terminus defined ds cDNA; 
cloning the resulting N-terminus defined ds cDNA; 
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separating the resulting N-terminus defined ds cDNA into coding and non-coding strands: 
liybridizing to the coding strand a second ollgonulceotide primer that inciudes.a stop anti-codon but is 
otherwise complementary to the sequence at about the juncture of said variable region and said constant 
region to produce a second duplex, said stop anti-codon being at said juncture and thereby introducing a 
6 stop codon at the terminus of said variable region, enzymatically treating the resulting duplex to elongate 
said second oligonucleotide primer in its 5*-3' direction complementary to said coding strand and 
digesting said coding ss cDNA in the other direction up to the sequence complementary to said second 
oligonucleotide primer, to produce N- and C-terminus tailored ds cDNA coding for the variable region of 
the light or heavy chain free of the constant region of said immunoglobulin; 
70 cloning the resulting N- and C-terminus tailored ds cDNA; 

' and inserting said N- and C-temilnus tailored ds cDNA into an expression vector or plasmid with said 
coding sequence in proper relationship with transcriptional and translational regulatory signals. 
The first oligonucleotide primer may homoduplex with said non-coding strand at the N-tenminus of said 
leader sequence; or may hybridize at about tiie juncture between said leader sequence and said variable 
75 sequence to introduce an initiation codon at tiie N-terminus of tiie DNA sequence coding for said variable 
region. At least one oligonucleotide primer may be only partially complementary to said cDNA strand. 

The method may include tiie additional step, prior to said inserting, of ligating unique restriction linkers 
to said N- and C-tenminus tailored ds cDNA and enzymatically cleaving said linkers to provide cohesive 
tenmini. The cloning after each hybridizing step may include the additional step of selecting clones having 
20 said tirst or second oligonucleotide sequence, isolating the DNA containing said ds cDNA and recloning 
said ds cDNA. 

The principles and details of the present invention can be applied to the preparation of a transformed 
expression vector or plasmid which carries a ds DNA sequence that codes for only a desired part of a 
polypeptide chain of a protein or enzyme and is equipped for initiation and termination codons at the 5'- 
25 and 3*-termini respectively of said DNA sequence; 
by a method comprising: 

preparing ds cDNA from an m-RNA coding for said protein or enzyme; 

removing nucleotide sequences from said ds cDNA superfluous to said desired part of said polypeptide 
chain and providing for initiation and termination codons at tiie 5'-and 3*-terminl respectively of tine DNA 
30 sequence to provide tailored ds cDNA encoding said desired part of said polypeptide chain, 
and inserting said tailored ds cDNA into an expression vector for expression of said ds cDNA. 

The foregoing features of the present invention, especially the steps a) to f), can be adapted 
accordingly. 

An important feature of the present invention comprises a metiiod for preparing a binding polypeptide 
as which consists essentially of the amino acid sequence of at least a portion of tiie variable region of a light or 
heavy chain of an immunoglobulin specific for a predetenmined ligand. said amino acid sequence having 
substantially the binding specificity of the analogous chain, 
said method comprising: 

preparing ds cDNA encoding at least one of said light or heavy chains from an mRNA coding for said chain; 
40 removing nucleotide sequences from said ds cDNA superfluous to said variable region, and providing for 
initiation and termination codons at the 5'- and 3'-termini respectively of the DNA sequence to provide 
tailored ds cDNA encoding said variable region; 

inserting said tailored ds cDNA into an expression vector for expression of said ds cDNA and transforming a 
host for said expression vector with said expression vector containing said tailored ds cDNA; 
45 growing said transformed host, whereby said binding polypeptide of one of said light and heavy chains is 
expressed; and 

isolating said binding polypeptide. 

Anotiier important feature of tiie present invention comprises a method for preparing a binding 
polypeptide which consists essentially of the amino acid sequence of at least a portion of the variable 
50 region of a light or heavy chain of an immunoglobulin specific for a predetermined ligand, said amino acid 
sequence having substantially tiie binding specificity of the analogous chain, 
said method comprising: 

growing a host transformed by a transformed expression vector or plasmid which carries a ds DNA 
sequence that codes for a variable region of a light or heavy chain of said immunoglobulin but lacks 
55 nucleotides coding for amino acid residues superfluous to said variable region and is equipped for initiation 
and termination codons at tiie 5'-and 3*-termini respectively of said DNA sequence. 

The preparation of the rFv by hybrid DNA technology will nob be described in greater detail. 
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1 . Isolation of appropriat DNA S quences. 

In preparing the DNA sequences, a source of the genes-encoding the variable region will be required. 
The variable regions may be derived from IgA, IgD, IgE, IgG or IgM, most commonly from IgM and IgQ. 

5 Hiis can be achieved by immunizing an appropriate vertebrate, normally a domestic animal, and most 
conveniently a mouse. The immunization may be carried out conventionally with one or more repeated 
injections of the immunogen into the host mammal, nonmally at two to three week intervals. Usually three 
days after the last challenge, the spleen is removed and dissociated into single cells to be used for cell 
fusion to provide hybridomas. 

10 The immunogen will be the antigen of interest, or where a hapten is present, an antigenic conjugate of 
the hapten to an antigen. 

In order to prepare the hybridomas, the spleen cells are fused under conventional conditions employing 
a fusing agent. e.g. PEG6000. to a variety of inter- or Intraspecies myeloma cells, particularly mouse ceils 
such as SP-2/0. NS-1, etc. and then suspended In HAT selective media. The surviving cells are then grown 
75 in microtiter wells and immunologically assayed for production of antibodies to the determinant site(s) of 
interest 

Assays for antibodies are well known in the art and may employ a variety of lal^eled antigens or 
haptens, where the labels are conveniently radioisotopes, fluorescers, enzymes, or the like. Other tech- 
niques may also be employed, such as sandwich techniques involving two antibodies, one bound to a 

20 support and the other labeled. The cells from microtiter wells scored as positive are cloned either by 
limiting dilution or in soft agar. The resulting cloned cell lines are then propagated in an appropriate nutrient 
medium and. if necessary, may be stored frozen in liquid nitrogen. 

After selection of a particular cell line providing a monoclonal antibody of interest, the cells are 
expanded. Convenientiy, the cells may be grown to a density of about 1x10^ cells/ml in a 1 liter culture. The 

25 cells are then harvested by centrifugation and lysed. 

To obtain the desired DNA sequence, one can look to either the gene expressing the variable region or 
tiie messenger RNA, which expresses the variable region. The difficulty with employing genomic DNA is in 
juxtaposing the sequences coding for the variable region when these sequences are separated by introns. 
One must isolate tiie DNA fragment(s) containing the proper exons, excise the Introns and then splice the 

30 exons in the proper order and orientation. Generally this will be difficult, so that the alternative technique 
employing the messenger RNA will be tiie method of choice. 

Where the messenger RNA is to be employed, the cells will be lysed under RNase inhibiting conditions. 
Mature messenger RNA has the advantage tiiat it is free of introns, so ttiat the nucleotide sequence is 
continuous for the entire variable region. Difficulties with messenger RNA have been encountered, owing to 

35 incomplete reverse transcription, but these can be minimized. The tirst step is to isolate the messenger 
RNA. Conveniently, messenger RNA.which is polyadenylated, can be separated from other RNA on an 
oligo-(dT) cellulose column. The mixture of messenger RNA will be obtained free of otiier RNA. The 
presence of messenger RNAs coding for the heavy arid light chain polypeptides of the Immunoglobulins 
may then be assayed by hybridization with DNA single strands of tiie appropriate genes. Conveniently, the 

40 sequences coding for the constant portion of the light and heavy chains may be used as probes, which 
sequences may be obtained from available sources (see. for example. Early and Hood. Genetic Engineer- 
ing, Setlow and Hollaender eds. Vol. 3. Plenum Publishing Corp., New York (1981), pages 157-188 ). 

Whether the messenger RNA codes for the correct immunoglobulin may be determined by in vitro 
translation ehiploying a rabbit reticulocyte cell-free extract (Pelham and Jackson. Eurp. J. Biochem. (1976) 

45 66:247-256). The resulting translation product may then be isolated by employing antibodies specific for 
one or more of tfie regions of the chain of interest, for example, using rabbit anti(mouse IgG) where the 
chains are derived from mouse immunoglobulin. 

The immunoprecipitate may be further analyzed by polyacryl amide gel electrophoresis, and the 
presence of complexes determined by using radiotagged receptors for antigen-antibody complexes, such as 

50 S. aureus protein A. Rf factor, or the like. In addition. RNA blot hybridization (resolution of the mRNA 
Famples on agarose gel, transfer of the mRNA to nitrocellulose filter, baking at 80 'C. and testing with 
radioactive probes) can be employed to further ensure that the correct messenger RNA is present. 

The crude mixture of mRNA sequences containing the desired mRNA sequences will be treated as 
follows. In order to enhance the probability that full length cDNA is obtained, the method of Okayama and 

55 Berg, Mol. Cell. Biol. (1982) may be employed. Alternatively, the metiiods described by Efstradiadis and 
Villa-Komaroff (1979) in Genetic Engineering: Principles and Metiiods 'U Setlow and Hollaender, eds.. New 
York, Plenum . Press, pages 15-36, or Steinmetz et al. (1981) Cell 24:1 25-1 34, may be employed. The first 
strand of cDNA is prepared employing a virus reverse transcriptase in the presence of oligo-(dT)-primer. A 
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second strand may then be prepared employing reverse transcriptase, the Kfenow fragment of DNA 
polymerase I or T4 polymerase. If necessary, the resulting ds cDNA may then be treated with a single- 
strand-specific nuclease such as SI nuclease for removal of §ingle stranded portions to result In ds cDNA, 
which may then be cloned. 

6 

2. Preparation of Genes Coding For L-rFv and H-rFv and Introduction into an Expression Vector For 
Amplification. 

A wide variety of vectors may be employed for amplification or expression of the ds cDNA to produce 

10 the light and heavy chains of the immunoglobulin. A vector having an appropriate restriction site is digested 
with the appropriate endonuclease. The ds cDNA obtained from the reverse transcription of the mRNA may 
be modified by ligating linkers, treatment with terminal transferase or other techniques to provide staggered 
(complementary) or blunt ended termini. The vectors may have one. two or more markers for selection of 
transformants. Desirably, the vector will have a unique restriction site in one of multiple markers, so that the 

15 transformants may be selected by the expression of one marker and the absence of expression of the other 
marker. Various markers may be employed, such as biocide resistance, complementation of an auxotroph, 
viral immunity, or the like. 

An appropriate host, usually bacterial, 
e.g. E. coll. B. subtilis, S. cerevisiae. etc., is transformed according to conventional methods with the ds 

20 cDNA prepared from the mRNA, and the transformants are spread on agar plates and selected in 
accordance with the particular markers. The resulting colonies are screened, by restriction electrophoretic 
pattem, by hybridization to a labeled probe or by any other conventional means. See. for example. Hanahan 
and Meselson (1980) , Gene 10:63-67. One procedure employs colony hybridization, where the transfor- 
mants are grown on a solid medium to produce colonies., Cells from the colonies are transferred to a 

25 nitrocellulose replica filter, the transferred cells incubated for further growth, lysed, dried and baked e.g. at 
80 'C. The replica filter is then hybridized with appropriate radioisotope labeled probes. Probes for the 
determinant bonding sites present in the constant regions of a variety of mammalian immunoglobulins are 
readily available. The colonies may be probed according to the nature of the particular immunoglobulin or of 
the different determinant sites that may be present in the particular immunoglobulin. 

30 The host colonies that hybridise with the probes, i.e. that have DNA coding for either the light or the 
heavy chain, are picked and then grown in culture under selective pressure. In order to maintain selective 
pressure, it is desirable that the vector which is employed have biocidal. particularly antibiotic, resistance. 
After sufficient time for expansion of the host, the host cells are harvested, conveniently by centrifugation. 
The hybrid plasmid DNA may then be isolated by known procedures (e.g. those of Gunsalus et aL. J. 

35 Bacterid. (1 979) 1 40:1 06-1 33). 

Thelsolated plasmid DNA is then characterized by restriction enzyme digestion and DNA sequence 
analysis. These analyses ensure that the isolated cDNA clones completely encode the variable region and, 
optionally, the leader sequences for the light or heavy chain of the desired immunoglobulin. Furthermore, by 
having a restriction map of the variable regions, leader sequences and flanking sequences, one can 

40 determine the appropriate restriction sites for excising a DNA fragment which will allow for appropriate 
modification of the DNA sequence for insertion into a vector and expression of the polypeptide of interest. 
Where no unique restriction site is available at an appropriate position in the flanking regions, partial 
digestion may be employed, with selection of fragments having the variable region and. optionally, the 
leader sequence intact Where the 5' and 3* flanking regions are too extended, these can be shortened 

45 using Bal 31 to varying degrees by varying the period of digestion. 

Furthermore, by knowing the DNA sequence of the coding strand in the region of the C-terminus of the 
heavy and light chain variable regions, a stop codon may be introduced at the C-terminus by the following 
procedure of in vitro mutagenesis. The cDNA is restricted with the appropriate enzyme(s) to provide a 
segment coding foTthe variable region with additional 5' and 3* flanking sequences. This segment is 

50 purified, for example by gel electrophoresis, gradient density centrifugation, etc. The desired segment is 
Isolated and its two strands are dissociated, conveniently by boiling. Alternatively, the undesired strand of 
the intact cDNA-pIasmid clone may be nicked and digested. 

A synthetic, single-stranded oligonucleotide(DNA oligomer) is prepared, conveniently by synthesis, 
which will have a least about 12 nucleotides, more usually about 15 nucleotides, and will generally have 

65 fewer than about 50 nucleotides, usually fewer than 30 nucleotides, since a more extended oligomer is not 
required. 

Where this synthetic DNA oligomer is used in heteroduplexing, the non-complemenfary nucleotides will 
usually be flanked by at least about three, more usually at least about six nucleotides complementary to the 
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hybridized strand. The heteroduplexing DNA oligomer will be complementary to the sequence at or about a 
significant juncture i.e. between the leader sequence and the variable region or the variable region and the 
constant region. The DNA oligomer will b substantially complementary to the coding ("sense") strand of 
the variable-region sequence but will be altered to encode a termination codon at the C-terminus of the 

5 variable region. That is, the DNA oligomer will be complementary to the coding strand except at or about 
the amino acid which is involved at the juncture of the variable region and the J-or C regions of the light 
and heavy chains, particularly at or intermediate the D- or J- regions or intermediate the J-region, or at the 
J-region and C-region juncture. It is Intended that there will be some variation in the polypeptides which are 
prepared, so far as extending beyond the variable domains or not including all of the amino acids at the O 

10 terminus of the variable region. 

A excess amount of the DNA oligomer is combined with the denatured strands of the restriction 
fragment under sufficiently stringent hybridization conditions. Thus, tiie DNA oligomer specifically 
heteroduplexes to the complementary portions of the coding strand, while providing one or more stop 
codons to ensure the termination of expression at the desired amino acid at the C-termlnus. 

/5 After sufficient time for hybridization at the desired level of stringency, sufficient amounts of the four 
deoxynucleotides are added in conjunction with the Klenow fragment of DNA polymerase I. A strand 
complementary to the coding sequence of the variable-region and any SMIanking sequence is synthesized 
by enzymatic elongation of the primer resulting in a sequence complementary to the strand to which the 
DNA oligomer is bound. The single-stranded DNA sequence on the coding strand located 3* to the region 

20 hybridized to the synthetic oligonucleotide is degraded by the 3'-5' exonuclease activity of the DNA 
polymerase. In tiiis manner, ds cDNA is obtained which specifically codes for the variable-region and 
upstream flanking regions associated with the light and heavy chains. Each of the heavy and light chains is 
encoded to terminate expression at a predetermined codon in the V. D or J region. 

The resulting heteroduplexed blunt-ended ds cDNA fragments are tiien employed for preparation of 

25 othenwise homoduplexed ds cDNA coding for the light and heavy variable regions with the stop codons at 
the desired sites. Conveniently, the blunt ended fragments are either modified as described previously, e.g. 
joined to linkers which code for restriction sites which are absent in the variable region sequences, or tailed, 
e.g. with polyG or polyC tails.or used directly for insertion. Thus a fragment, after being joined to a 
restriction site linker, can be inserted into an appropriate vector having complementary termini, and then 

30 when desired can be recovered by restriction at the linker sites. The linkers are joined to the blunt-ended 
fragments with an appropriate ligase e.g. T4 ligase, and tiie resulting ligated fragment is restricted to 
provide a shorter fragment with cohensive ends, which is annealed to the complementary ends of a vector. 

This vector provides for amplification and convenient isolation of transformants having the variable 
region coding sequence insert. Numerous vectors for amplification in bacteria or other hosts exist such as 

35 pBR322, pSCIOI. pRK290, 2fi-plasmid. etc. Annealing the shorter fragment with the cohesive ends to the 
vector will provide a hybrid plasmid that has complementary DNA sequences except where the DNA 
oligomer introduced heteroduplexing; here the DNA sequences will be mismatched. This hybrid plasmid 
(containing mismatched sequences) will replicate in the host to generate two different plasmid molecules, 
one with the original sequence and one with the "tailored" or "site mutated" sequence derived from the 

40 syntfietic DNA oligomer. Therefore, each transformant colony is grown in small (approximately 2ml) culture, 
and the plasmid DNA is isolated in accordance with known procedures and used for a second cycle of 
transformation to provide individual clones replicating tiie "tailored" sequence. The clones may be screened 
by filter blot hybridization, or by probing with a labeled synthetic DNA oligonucleotide, e.g. the synthetic 
DNA oligomer employed in "tailoring" the variable region sequence, or by some other convenient 

45 technique. In this way piasmids are obtained having ds cDNA flanked by appropriate restriction sites and 
having a stop codon at a predetermined site. 

The 3*-terminus of the coding strand (defining Jhe C-terminus amino acid) has been defined, and the 5'- 
region of tiie coding strand (defining the N-terminus of the polypeptide) is next defined. Of course, the 
particular order in which tiie two termini are modified is primarily one of convenience; indeed, the two 

so termini can even be modified simultaneously, where primer repair is used at the 5'-end of the coding strand 
in conjunction with site mutation at the 3*-end. 

Different strategies may be developed, depending upon the nature of the host in which expression is to 
be obtained, and whether the host removes the leader sequence after recognising it as a secretory signal 
for secretion of the polypeptide. If the host cannot do this, then a DNA sequence coding for the leader 

65 sequence must be removed to provide a start codon at the 5'-terminus of the sequence of the coding strand 
coding for the variable region; then the shortened sequence' can 'l59^%T!50?!e^^r#» 3»^^ vectorln 
such a way as to be under the control of a predetermined promoter and ribosomal start site. 

Based on tiie sequence of the leader r gion or the sequence coding for the N-terminus of the variable 
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region, different oligonucleotides for homo- or heteroduplexing can be prepared. 

Where the leader sequence is retained, primer repair is employed to remove the 5*-flanking sequ nee 
of the coding strand. When the primer repair of the N-tenninus is performed simultaneously with the C- 
terminus mutagenesis, the coding strand, duplexed with the oligonucleotide, is treated with DNA poly- 
5 merase. and the resulting partial double-stranded DNA Is then treated with a 5'-3*-single strand exonuclease 
to remove the 5*-flanking region and with a ligase to provide for covalent linking of the replicated strand to 
the N-terminus oligonucleotide. 

Where the leader sequence is to be removed, in vitro mutagenesis is employed to introduce an 
initiating codon (ATG, for N-formyi-methionine [f-met])'at the N-terminus of the DNA sequence coding for 
70 the variable region. 

Altemative strategies may be employed for recovering the desired ds cDNA and performing the in vitro 
mutagenesis. If useful restriction sites are distant from the coding regions, the plasmid may be digested 
with the appropriate restriction endonuclease, followed by digestion with a double-strand exonuclease e.g. 
Bal 31. The resulting ds cDNA may be cloned and the proper sequence selected and modified, as 

75 appropriate, as described above. If the non-coding flanking region at the 5'-terminus of the coding strand is 
too long, it may be digested with an endonuclease where a convenient restriction bite is available, or by 
digestion with an exonuclease e.g. Bal 31. 

By repeating the above described procedure for modifying the 3*-terminus, except that the 
oligonucleotide is now complementary to the non-coding (nonsense) strand and includes an initiation codon 

20 at the 5'-end (primer repair) or within the oligonucleotide (in vitro mutagenesis), the S'-temninus of DNA 
sequence encoding tiie variable regions may be tailored. Normally, the oligonucleotide homoduplexes for 
primer repair and heteroduplexes for in vitro mutagenesis. In this way. "tailored" ds cDNA is obtained which 
has start and stop codons properly positioned to define the variable regions of both the light and heavy 
chains of immunoglobulins. The resulting blunt ended ds cDNA may be modified, e.g. by addition of linkers. 

25 to provide complementary termini for insertion into an expression vector in proper spacing to the regulatory 
signals which are ligated to the ds cDNA or are present in the vector. 

The ds cDNA is now ready for insertion into a vector for expression. As distinguished from tiie eariier 
vectors, which were solely concerned with replication of the ds cDNA. the vector which is employed at this 
stage requires the presence of the regulatory signals for transcription and translation. 

30 A vector is chosen having an appropriate promoter, as well as other transcriptional regulatory signal 
sequences, such as an operator, attenuator, or activator. Also, the vector will have been at least partially 
sequenced, so as to determine the presence of at least one insertion site for introduction of the ds cDNA 
coding for the variable regions at a site under the control of the regulatory signals. 

Besides transcriptional regulatory signals there are, as already indicated, translational regulatory signals. 

35 primarily tiie ribosomal binding site (Shine-Dalgarno sequence. "S-D") and tiie initiation codon ("f-met 
codon "y. The S-D sequence and the initiation codon must be in the proper spacing, generally spaced apart 
by from about 3 to 12 base pairs. The S-D sequence may be present on tiie vector in appropriate 
juxtaposition to an insertion site or may be joined to the variable region coding sequence, for example, by 
ligation of an oligonucleotide providing the S-D sequence and an appropriate restriction site upstream from 

40 the S-D sequence. Alternatively, the S-D sequence may be Introduced by in vifro mutagenesis, as 
previously described. The coding sequence must be in frame with the initiation codon. 

In choosing the different strategies, considerations include the presence or absence of particular 
restriction sites in the variable region coding sequence and flanking regions; the availability of vectors which 
allow for insertion of the ds cDNA sequence into the vector and expression of the variable region 

45 polypeptide; tiie availability of useful shuttle vectors: the availability of hosts which penmit expression and 
isolation in good yield; and the ability of the host to recognize such signals as secretory signals to cleave 
off the leader sequence. Therefore, in each situation with each different idiotype, it will be necessary to 
make a restriction map of at least portions of the DNA sequence coding for tiie variable region and the 
flanking regions. 

50 Where the termini of the vector and sequence to be inserted are tiie same, it will be necessary to check 
that the sequence has been inserted in the correct rather than the inconrect orientation. By mapping tfie 
resulting cloned plasmids after insertion, one can select those plasmids having the variable region sequence 
in the proper orientation. 

The above strategy allows tor a number of important advantages. The polypeptide chains are prepared 
55 as a homogeneous composition containing identical sequences and chain lengths. The polypeptides 
;fatb^iSl!&.ifii© fff .wiH1>e'tree-of sugars and. by virtue of their homogeneous and ungiycosylat d character, 
may be more unifomnly labeled or modified. In this way products are obtained of uniform and reproducible 
properties. Thus, the products may be reliably administered to a mammalian host with relatively slight 
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concern for unexpected responses that could be caus d by a h terogeneous spectrum of products. 
The following Example illustrates the present invention but in no way limits it: 

EXPERIMENTAL 

5 

The following Example description will be directed to the dinitrophenyl ligand as an example of a typical 
ligand. It is to be understood that the subject process will be useful for any ligand, although, owing to the 
wide variety of idiotypes involved, at various stages the process may have to be modified slightly to 
accommodate the presence of a particular restriction site or other unique feature. 

TO 

Example 

Preparation of Monoclonal Antitxjdies for Dinitrophenyl (DNP) 

75 Into an aqueous buffered medium at about pH 10.5 is introduced lOmmoles 2,4-dinitroben2ene 
sulfonate (c.f. Eisen et al., J.A.C.S. , 75 (1953), 4583) and O.OImmole of keyhole limpet hemocyanin and the 
mixture rocked for 20" hours at room temperature. The solution is then dialyzed against successive changes 
of 0.6M NaCI and the residue is isolated to be used for immunization. 

This DNP immunogen, 100 ;tg, is Combined as an emulsion with 0.1ml complete or incomplete 

20 Freund's adjuvant and 0.1ml PBS per dose. Each of 6 BALB/c mice is injected with four such doses at 
weekly intervals, each dose distributed intraperitoneally as well as subcutaneously into foot pads and into 
inguinal areas. The first injection is given with complete and the remaining three with incomplete Freund's 
adjuvant. Three days after the last injection, the mice are sacrificed and the spleens are isolated and used 
for formation of monoclonal antibodies. 

25 The fusion is performed by combining 3x10^ Sp2/0-Ag14 myeloma cells (Shulman et al. (1978) Nature 
276:269-270) and 5x10^ spleen cells and the mixture centrifuged at 200g for 5min and fesuspended slowly 
in 0.8ml 50% PEG 1500 in Dulbecco's modified Eagle's medium (Flow). After 1 min. at 37* C, 20ml of R 
medium (RPMI 1640 medium (Gibco) supplemented with 30m M Hepes (Row)) is added slowly. The cells 
are then centrifuged and resuspended in 20ml of R medium supplemented with 10% fetal caifs serum 

30 (Gibco) (RF medium) and 0.2ml of this suspension is then distributed to each of 200 wells containing 0.8ml 
RF medium. One hundred of these wells also contain 2x1 0^ mouse peritonea! exudate cells. After 24 hours* 
incubation, 1ml RF supplemented with HAT medium is added to each well. Every 2-3 days, 1ml or the 
medium is replaced with fresh RF+HAT. After two weeks, the cells demonstrating growth are tested for 
immunoglobulin production employing 35S-2,4-dinitrophenylsulfenamide of lysine. Clones showing specific 

35 activity are cloned by plating in soft agar to provide anti-DNP as required. 

Alternatively, one may use the method described by Herzenberg et al. (1980) J. Exp. Med . 151: 1071- 
1087. In this method, DNP substituted bovine serum albumin is addedio individual wells in a microtiter 
plate in an RIA diluent (1% BSA, 0.005M EDTA and 0.1% NaNa in PBS pH7.6) (50;il. 0.05mg/ml) and the 
mixture is incubated for 1 hour in the wells. Test or standard antisera at various dilutions are then added to 

40 coated wells (20/il/weIl) and incubated For 1h. After washing three times with the RIA diluent, ^^\-\abe\ed 
anti-mouse immunoglobulin (approximately 2x10^ cpm/well) is added and the mixture in incubated for 1h. 
Plates are then washed 3x with the RIA diluent, dried and evaluated by autoradiography. 

Both these methods of detecting the presence of the desired antibody are well known. The cells are 
then cloned either by limiting dilution or in soft agar and the resulting cloned cell lines are propagated and 

45 stored frozen in liquid nitrogen for use as required. 

Cells from one of the positive cloned cell lines are grown to a density of about 1x10^ cells/ml in a one 
liter culture. The cells are harvested by centrifugaji.on and 1 gram of the cells is dropped into 16ml of 
guanidlnium thiocyanate stock solution (4M, 50g of guanidinium thiocyanate with 0.5g of sodium N-lauryl 
sarcosine, 2.5ml of 1M sodium citrate. pHT.O,' 0.7m! of '2-mercaptoethanol and 0.5ml of Sigma 30% 

50 Antifoam A, and the volume brought to 100ml at room temperature) in & 55ml Potter-Elbehjem homogenizer 
tube and is immediately homogenized for 30-60 sees, at full speed with an 18mm diameter Tissumizer 
homogenizer (Tekmar Industries). The resulting homogenate is centrifuged for 10min. at S.OOOrpm in a 
Sorval HB4 swinging bucket rotor at 10* C. The supernatants are decanted into & flask, mixed with 0.024 
volume (relative to the original volume of homogenizing buffer) of 1 M acetic acid to lower the pH from 7 to 

55 5 and 0.75 volume of absolute ethanol. The flask is capped and thoroughly shaken and then stored at 

'"^S^Bife O^fffi^ material Is sedimented by centrifugation for 10min at -10* C at O.OOOrpm in an 

HB4 rotor. 

The resulting firm pellet is isolated and resuspended by vigorous shaking in 0.5 volume buffered 
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guanidine hydrochloride stock solution (7.5M. neutralized and then buffered with 0.25 volume of 1M sodium 
citrate. pH7.0. 5mM in dithiothreitol). The samples are briefly warmed in a 68* C water bath to ensure 
complete dispersion of the pellets, followed by precipitation by adding 0.025 volume (relative to the amount 
of guanidine hydrochloride) of 1M acetic acid in 0.5 volume ethanol. The solution is maintained for at least 

5 3h at -20*C and then centrifuged; the resulting pellet is resuspended in guanidine hydrochloride and 
reprecipitated as described above. The reprecipitated material is centrifuged for 5min at O.OOOrpm and 
thereafter all reactions are canried out under sterile conditions. 

The final pellets are dispersed in ethanol at room temperature, triturated to extract excess guanidine 
hydrochloride and then centrifuged for 5min at G.OOOrpm, The ethanol is evaporated with a stream or 

10 nitrogen and the RNA pellets dissolved with vigorous shaking in 1ml of sterile water per g. of original cells. 
After centrifugation for lOmin at 13.000rpm at 10* C. the supernatant containing the RNA is decanted. To 
ensure the complete extraction of all the RNA. the insoluble material is reextracted twice with 0.5ml of 
sterile water, the extract centrifuged for lOmin at I3.000rpm at 10* C and the aqueous solutions combined, 
mixed with 0.1 volume of 2M potassium acetate/acetic acid, pH5, and 2 volumes of ethanol and left 

75 overnight at -20* C. 

The RNA is sedimented from the ethanol suspension by centrifugation for 20min at 20.000rpm at -10 C 
in Corex tubes in an HB4 rotor. The resulting pellets are thoroughly washed with 95% ethanol, dried with 
nitrogen and dissolved in Iml/g cells of lOmM Tris buffer pH 7.5. 1mM EDTA. 0.2% SDS. After dissolution 
of the RNA pellet. 1/9 volume of 5M NaCI is added, and the solution applied to an oligo(dT)column (about 

20 0.5g dry weight. T3 grade, Collaborative Research). The column is washed extensively with 0.5M NaCI. 
lOmM Tris, 1mM EDTA. pH 7.5,0.2% SDS. and then eluted with lOmlVI Tris, ImM EDTA. pH 7.5. 0.05% 
SDS. The elution profile is monitored at X 2so. The UV absorbing fractions are pooled and precipitated by 
addition of acetic acid/sodium acetate buffer. pH 5, and 2.5 volumes of ethanol. The dried pellet is dissolved 
in 50fi\ (1 vol.) lOmM Tris. pH 7.5, ImM EDTA, and 9 vol. DMSO added, immediately followed by 1 vol. of 

25 buffered 1M UCL (1M UCI. 50mM EDTA, 2.0% SDS, lOmM Tris. pH 6.5). This solution is heated at 55* for 
5min, 100 vol. of binding buffer added, and then reapplied to the oligo(dT) cellulose column, equilibrated 
with binding buffer (0.5M NaCI. lOmM Tris, ImM EDTA. 0.2% SDS) and eluted as before. 

The presence of messenger RNA encoding the monoclonal immunoglobulin heavy and light chain 
polypeptides is verified by hybrid selection employing DNA clones of the appropriate heavy and light chain 

30 genes from sources described in Eariy and Hood. Genetic Engineering (1981) Vol. 3. Setlow and Hollander, 
Plenum Publishing Corp., pages 157-188, DNA probes can be prepared by synthesis, based on published 
amino acid sequences or published DNA sequences or obtained from a variety of sources reported in Early 
and Hood, supra. The DNA probes are denatured, neutralized and bound to nitrocellulose filter paper 
(Schleicher andSchuell BA-85-R 597) according to the method of Southern. J.Mol. Biol. (1975) 98:503-517, 

35 in lOx cone, standard citrate. (See also U.S.P. No. 4,302.204.) The probes are hybridized to 30 /iQ of tfie 
messenger RNA in 65% formamide/10mM Pipes buffer,pH6.4/0.4M NaCI in a final volume of 100 ;il at 50*C 
for 2h. The reaction mixture is spun for lOsec. in a Microfuge^ vortexed, spun again and then gently 
vortexed to resuspend the filters. The mixture is incubated at 50*C for about 1h with mild agitation. The 
reaction mixture is then removed and the filters are washed ten times in 1ml 0.1 5M NaCI/0.15M Na 

40 citrate/0.5% NaDodSG*. while maintaining the. wash buffer at 60 *C. After each addition of wash buffer, the 
tubes are vortexed for several seconds. The filters are then washed twice witii 1ml lOmM Tris, pH 7.8, 2mM 
EDTA. the tubes being incubated at 60* C for 5min and the solution then removed by aspiration. 

RNA is eluted from ttie RNA-DNA hybrid by boiling the filters for 60sec in 300 ft\ of double distilled, 
sterile water and then quick-freezing in a methanoi/dry ice bath. After thawing on ice, tiie water (containing 

45 the RNA) is transferred to a second tube and brought to a final concentration of 0.2M of sodium acetate, 
and 20 fig of calf thymus tRNA is added. The RNA is precipitated witfi 2.5 volume of ethanol at -20* C and 
immediately prior to translation tiie RNA is pelleted at 12.000g for lOmIn at 4*C, the pellet washed twice 
witii 70% ethanol and then dried under reduced pressure. 

The eluted mRNA is now translated in vitro with rabbit reticulocyte celt-free lysate, e.g. that of the 

50 commercially available translation kit frorrT New England Nuclear, whereafter protein syntiiesis may be 
detected according to instaictions with the kit. 

Aliquots are then incubated with monoclonal antibodiea in substantial excess to the amount of 
expression product in the in vitro translatiiaa lysate, Jtia.ajoibodxaoiisen complex is then, precipitated with 
fixed S. aureus and the pTecipitates are washed three times in 0.05M Tris. pH8.3. 0.45M NaCI in 0.5% 

55 NP40.~boiled in 0.01 M sodium phosphate buffer. pH7.5, containing 1% B-mercaptoethanol and tiien 
etectrophoresed on 5-20% gradient SDS-poiyacrylamide gels at 125V until tiie bromophenol blue marker 
runs off the end of the gel and for one further hour. The gels are then dried, fixed and autoradiographed on 
Kodak X-R film to establish the presence of messenger RNA coding for immunoglobulin light and heavy 
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chains. This messenger RNA mixture Is th n employed to prepare a library of double stranded cDNA by the 
method of Okayama and Berg. Molecular and Cellular Biology . Feb. 1982, 161-170. First, the preparation of 
V ctor primer and oligo dG-tailed linker DNA^ill be described: 

400 (iQ of pBR322-SV40 (map units 0.71-0.86) DNA are digested at 37* with 700 units of Kpnl 
6 endonuclease in a reaction mixture {0.4ml) containing 5mM Tris-HCI (pH 7.5). 6mM MgCb. 6mM NaCI. 
6mM 2-mercaptoethanol and O.lmg/ml bovine serum albumin (BSA). After 5hrs. the digestion is tenminated 
with 40 ti\ of 0.25M EDTA (pH 8.0) and 20 fil of 10% SDS; the mixture is extracted (to remove protein) with 
water-saturated phenol-CHCb (1:1) (hereafter refenred to as phenol-CHCb) and the DNA is precipitated with 
ethanol. 

10 Homopolymer tails averaging 60. but not more than about 80, dT residues per end are added to the 
kpnl endonuclease-generated termini with calf thymus tenminal deoxynucleotidyl transferase as follows: the 
reaction mixture (0.2ml) contains as buffer 140mM sodium cacodylate-30 mM Tris-HCI (pH 6.8), ImM 
CoCl2, 0.1 mM ditiiiotiireltol, 0.25mMdTTP. the Kpn l endonuclease-dlgested DNA and 400 units of the 
terminal deoxynucleotidyl transferase. After 30 minutes at 37* C the reaction is stopped with 20 p\ of 0.25M 

15 EDTA (pH 8.0) and 10 ^tl of 10% SDS and the DNA is recovered after several extractions witii phenol-CHCb 
by ethanol precipitation. The DNA is then digested with 17 units of Hpa l endonuclease in 0.2ml containing 
10mM Tris-HCI (pH 7.4), lOmM MgCb, 20mM KCI. ImM ditiiiotiireitol and O.lmg/ml BSA for 5hrs at 37* C. 

The large DNA fragment, which contains the origin of pBR322 DNA replication and the gene confening 
ampicillin resistance, is purified by agarose (1%) gel electrophoresis and is recovered from the gel by a 

20 modification of the glass powder method (Vogelstein and Gillespie, PNAS USA (1979) 76:615-619). 

The dT-tailed DNA is further purified by adsorption and elution from a oligo dA-cellulose column as 
follows: The DNA is dissolved in 1ml of lOmM Tris-HCI (pH 7.3) buffer containing 1mM EDTA and 1M NaCI. 
cooled to 0* and applied to an oligo dA-cellulose column (0.6 x 2.5 cm) equilibrated with tiie same buffer at 
0*. The column is washed with the same buffer at O' and eluted vyith water at room temperature. The 

25 eluted DNA (14 ng) is precipitated with etiianol and dissolved in 100 it\ of lOmM Tris-HCI (pH 7.3) witii ImM 
EDTA. 

The oligo dG-tailed linker DNA is prepared by digesting 100 ftg of pBR322-SV40(map units 0.19-0.32) 
with 120 units of PstI endonuclease in 0.2ml containing 6mM Tris-HCI (pH 7.4), 6mM MgCb. 6mM 2- 
mercaptoethanol, 50m M NaCI and O.lmg/ml BSA After 1.5hrs at 37* the reaction mixture is extracted with 

30 phenol-CHCIa and the DNA is precipitated with alcohol. Then, tails of 10-15 dG residues are added per end 
with 60 units of terminal deoxynucleotidyl transferase in the same reaction mixture (50 /*!) described above, 
except that 0.1 mM dGTP replaces dTTP. After 20 minutes at 37* C the mixture is extracted with phenol- 
CHCIa. and tiie DNA is precipitated with ethanol and digested with 50 units of Hindlll endonuclease in 50 /il 
containing 20mM Tris -HCI (pH 7.4), 7mM MgCb, 60mM NaCI and O.lmg/ml BSA at 37* for 1 hr. The small 

35 oligo dG-tailed linker DNA is purified by agarose (1.8%) electrophoresis and recovered as described above. 
The reaction mixture (10 contains 50mM Tris-HCI (pH 8.3), 8mM MgCb, 30mM KCI. 0.3mM 
ditiiiothreitol, 2mM each dATP, dTTP, dGTP. and ^ap-dCTP (850 cpm/pmol), 0.2 fig of tiie mRNA (about 2-3 
fold excess over primer ends), 1.4 ^ig of the vector-primer DNA (0.7 pmole primer end) and 5 units of 
reverse transcriptase. (The molar ratio of polyA mRNA to vector-primer DNA ranges from about 1 .5:1 to 

40 3:1.) 

cDNA syntiiesis is initiated by the addition of the reverse transcriptase and continued at 37* for 20min. 
By this time tfie rate of dCTP incorporation levels off and more than 60% of the primer is utilized for cDNA 
synthesis. The reaction is stopped with ^^t\ of 0.25M EDTA (pH 8.0) and 0.5 ti\ of 10% SDS; 10 /:l of phenol- 
CHCI3 is added and the solution vortexed vigorously and then centrifuged. 10 fil of 4M ammonium acetate 

45 and 40 ti\ of ethanol are added to the aqueous phase, and the solution is chilled with dry ice for 15min, 
warmed to room temperature with gentie shaking to dissolve unreacted deoxynucleoside triphosphates that 
precipitate during chilling, and centrifuged for lOmin in an Eppendorf microfuge. The resulting pellet is 
dissolved in 10 ti\ of lOmM Tris-HCI (pH 7.3) and ImM EDTA, mixed with 10 n\ of 4M ammonium acetate 
and reprecipitated with 40 fi\ of etiiano*,-and'!hen1f1nsed wiff' ethanol. 

50 The pellet containing the cDNA:mRNA plasmid is dissolved in 15 fi\ of 140mM sodium cacodylate- 
30mM Tris-HCI (pH 6.8) buffer containing 1mM C0CI2, 0.1 mM dithiotiireitol, 0.2 ^9 of poly A 66;iM ^^p- 
dCTP (6000 cpm/pmol) and 18 units of terminal deoxynucleotidyl transferase. The reaction is carried out at 
-'^7* for 5min to permit the addition of tO to tSj»sidue5^4>f dCMP per end and then terminated witii 1.5 ^1 of 
0.25M EDTA (pH 8.0) and 0.75 ^.1 of 10% SDS. The mixture is extracted with 15 fi\ of phenol-CHCb and 

55 mixed with 15 /il of 4M ammonium acetate, the DNA is precipitated and reprecipitated with 60 /il of ethanol 
and the tinal pellet rinsed witfi ethanoL 

This pellet is dissolved in 10 ;:l of buffer containing 20mM Tris-HCI (ph 7.4) 7mM MgCb, 60mM NaCI 
and 0.lmg/mi BSA and then digested with 2.5 units of Hind lll endonuclease for Ihr at 37* C. The reaction is 
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terminated with 1 fi\ of 0.25M EDTA (pH 8.0) and 0.5 ti\ of 10% SDS, the mixture is extracted with phenol- 
CHCI3. 10 /il of 4M ammonium acetate Is added and the DNA is precipitated with 40 pi of ethanoL Th 
r suiting p llet is rins d with ethanol and dissolv d in 10 fil ofJOmM Tris-HCI (pH 7.3) and ImM EDTA, and 
3 n\ of ethanol ar added to prevent freezing during storage at -20* C. 

1 (t\ of the HIndlll-endonuclease-digested oligo dOtailed cDNA:mRNA plasmid (0.02 pmol) Is incubated 
in a mixture (lOTiTcontaining lOmM Tris-HCI (pH 7.5). 1mM EDTA. 0.1 M NaCI and O.04 pmol of the oligo 
dQ-tailed linker DNA (this amount is a one-fold molar excess over the quantity of the double strand 
cDNA:mRNA and of the fragment which remains as a result of the Hind lll endonuclease digestion in the 
previous step) at 65* for 2min., followed by 42* for 30min, and then cooled at 0* . The mixture (10 ftl) is 
adjusted to a volume of 100 ^1 containing 20mM Tris-HCI (pH 7.5). 4mM MgCb. lOmM (NH4)2S04, 0.1 M 
KCI. 50 ftg/ml BSA and 0.1 mM /5-NAD; 0.6 ^rg of E. coli DNA ligase is added and the solution is incubated 
overnight at 12*. 

To replace the RNA strand of the double strand cDNA:mRNA. the ligation mixture is adjusted to contain 
40;£M of each of the four deoxynucleotide triphosphates, 0.1 5mM j3-NAD. 0.4fig of additional E. coli DNA 
ligase. 0.3/ig of E. coli DNA polymerase I. and 1 unit of E. coli RNase H. This mixture (104/tl) is incubated 
successively atT2'^and room temperature for Ihr each to promote optimal repair synthesis and nick 
translation by Poll. The reaction Is tenninated by the addition of 0.9ml of cold lOmM Tris-HCI (pH 7.3). and 
0.1ml aliquots are stored at O' C. 

Transfonnation is carried out using minor modifications of the procedure described by Cohen et al.. 
PNAS USA (1972) 69:2110-2114. E. coli K12 (strain HB101) is grown at 37*C in 20ml standard L-broth to 
an optical density of 0.5 at X 6oorThe"cell5 are collected by centrifugation. suspended in 10ml of lOmM 
Tris-HCI (pH 7.3) containing 50mM CaCb and centrifuged at 0*C for 5min. The cells are resuspended in 
2ml of the above buffer and incubated again at 0* C for 5min.; then. 0.2 ml of the cell suspensions is mixed 
with 0.1ml of the DNA solution and incubated at 0* C for 15min. After the cells are kept at 37* C for^2min. 
and at room temperature for 10min., 0.5ml of standard L-broth is added, the culture incubated at 37 C for 
30min.. and then plated on nitrocellulose filters on agar plates containing 50 fig/m! ampicillin. After 
incubation at 37* C for 12-24 hrs.. E. coli transformants are screened for the presence of the light and heavy 
chain cDNA according to the method of Grunstein and Hogness by in situ colony hybridization. Several 
thousand transfonnants are grown on three replica nitrocellulose filter discs, lysed with alkali and hybridized 
with the probes described previously for the constant regions of the heavy and light immunoglobulin chains. 
Clones of the genes coding for the heavy and light immunoglobulin chains are identified. Colonies that give 
positive hybridization signals are grown in one-liter of L-broth containing 50 ftg/ml of ampicillin. and their 
plasmid DNAs are isolated by standard techniques (Gunsalus et aL. J. Bact. (1979) 140:106-113). 

The cells are lysed as described previously, the lysate cleared by centrifugation and the cleared lysate 
diluted with an equal volume of water. RNase A is added to 50 figlml and. after 1h at 37*C, the lysate is 
extracted with 0.3 volume of phenol saturated with TE buffer (lOmM Tris-HCI. pH 7.9. plus ImM EDTA). 
After centrifugation (16.000g. 4* C. lOmin), the aqueous phase is removed and adjusted to 1M NaCI and the 
DNA precipitated with 2 volumes of ethanol. After several hours at -20 *C. the DNA is pelleted by 
centrifugation (lO.OOOg. 4* C 20min), dried and dissolved in TE buffer. 

Each cDNA clone is then restriction mapped and sequence analysed by conventional techniques, so 
that a restriction map is obtained which allows for subsequent manipulation of the cDNA coding for the 
variable regions for cloning and expression. The methods of Maxam and Gilbert. Methods Enzymol. (1980) 
65:499-560 and Sanger et aL. J.Mol.Biol. (1980) 143:161-178 are used, respectively. Those cDNA clones for 
light chains and heavy cFains encoding the complete variable region end 4eader sequences are selected for 
subsequent manipulation. 

Illustrative of the subject method will be the isolation, sequencing and manipulation of the K-chain (light 
chain) of M0PC41 and tiie heavy chain of the myeloma SI 07. 

The following is the sequence of the K-chain of M0PC41. where the sequences encoding the leader, 
variable region and constant re§*afT:w.^pafiie?*:^y/gaps. with only the first sixteen amino acids of th© 
constant region indicated (Seidman et aL. "Nature" (1979) 280: 370-375): 
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Met Asp Met Arg Ala Pro Ala 
. . . TCA GGA CTC AGC ATG GAC "ATG AGG GCT CCT GCA 

Gin lie Phe Gly Phe Leu Leu Leu Leu Phe Gin Gly 
CAG ATT TTT GGC TTC TTG TTG CTC TTG TTT CAA GGT 

Thr Arg Cys Asp lie Gin Met Thr Gin Ser Pro 
ACC AGA TGT ... GAC ATC CAG ATG ACC CAG TCT CCA 

Ser Ser Leu Ser Ala Ser Leu Gly Glu Arg Val Ser 
TCC TCC TTA TCT GCC TCT CTG GGA GAA AGA GTC AGT 

Leu Thr Cys Arg Ala Ser Gin Asp He Gly Ser Ser 
CTC ACT TGT CGG CCA AGT CAG GAC ATT GGT AGT AGC 

Leu Asn Trp Leu Gin Gin Glu Pro Asp Gly Thr lie 
TTA AAC TGG CTT CAG CAG GAA CCA GAT GGA ACT ATT 

Lys Arg Leu He Tyr Ala Thr Ser Ser Leu Asp 3er 
AAA CGC CTG ATC TAC GCC ACA TCC AGT TTA GAT TCT 

Gly Val Pro Lys Arg Phe Set Gly Ser Arg Ser Gly 
GGT GTC CCC AAA AGC TTC AGT GGC AGT AGG TCT GGG 

Ser Asp ^r Ser Leu Thr He Ser Ser 
TCA GAT TAT TCT CTC ACC ATC AGC AGC 



Leu Glu Ser 
CTT GAG TCT 
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Glu Asp Phe Val Asp Tyr Tyr Cys Leu Gin Tyr Ala 

GAA GAT TTT GTA GAC TAT TAG JX3T CTA CAA TAT GCT 

Ser Ser Pro Trp Thr Phe Gly Gly Gly Thr Lys Leu 
ACT TCT CCG TGG ACG TTC GGT GGA GGC ACC AAG CTG 

Glu He Lys Arg Ala Asp Ala Ala Pro Thr Val 
GAA ATC AAA CGT ... GCT GAT GCT GCA CCA ACT GTA 

Ser He Phe Pro Pro Ser Ser Glu Gin 
TCC ATC TTC CCA CCA TCC AGT GAG CAG ... 

The following is the nucleotide sequence of the heavy chain variable region of myeloma S107. with the 
leader, variable region and constant region separated by gaps, and only the first nine amino acids of the 
constant region depicted (Early et al. (1980), Cell. 19:981-992): 
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Met Lys Leu Trp Leu Asn Trp Val Phe Leu Leu 
ATG AAG TTG TGG TTA AAC TGG GTT TTT CTT TTA 



Thr Leu 



ACA CTT 



Leu His Gly lie Gin Cys • . - Glu Val Lys Leu 
TTA CAT GGT ATC CAG TGT GAG GTG AAG CTG 



Val Glu 



GTG GAA 



Ser Gly Gly Gly Leu Val Gin Pro Gly Gly Ser Leu Arg 
TCT GGA GGA GGC TTG GTA CAG CCT GGG GGT TCT CTG AGA 

Leu Ser Cys Ala Thr Ser Gly Phe Thr Phe Ser Asp Phe* 
CTC TCC TGT GCA ACT TCT GGG TTC ACC TTC AGT GAT TTC 

Tyr Met Glu Trp Val Arg Gin Pro Pro Gly Lys Arg Leu 
TAC ATG GAG TGG GTC CGC CAG CCT CCA GGG AAG AGA CTG 

Glu Trp lie Ala Ala Ser Arg Asn Lys Ala Asn Asp Tyr 
GAG TGG ATT GCT GCA AGT AGA AAC AAA GCT AAT GAT TAT 

Thr Thr Glu Tyr Ser Ala Ser Val Lys Gly Arg Phe lie 
ACA ACA GAG TAC AGT GCA TCT GTG AAG GGT CGG TTC ATC 

Val Ser Arg Asp Thr Ser Gin Ser He Leu Tyr Leu Gin 
GTC TCC AGA GAC ACT TCC CAA AGC ATC CTC TAC CTT CAG 

Met Asn Ala Leu Arg Ala Glu Asp Thr Ala He Tyr Tyr 
ATG AAT GCC CTG AGA GCT GAG GAC ACT GCC ATT TAT TAC 

Cys Ala Arg Asp Tyr Tyr Gly Ser Ser Tyr Trp Tyr Phe 
TGT GCA AGA GAT TAC TAC GGT AGT AGC TAC. TGG TAC TTC 

Asp Val Trp Gly Ala Gly Thr Thr Val Thr Val Ser Ser 
GAT GTC TGG GGC GCA GGG ACC ACG GTC ACC GTC TCC TCA 

Ala Lys Thr Thr Pro Pro Thr Val Tyr 
. . . GCC AAA ACG ACA CCC CCA TCT GTC TAT . • . 

Based on the DNA sequencing and the restriction map, PstI sites are found at the -110 base pair of the 
coding strand and downstream from the tenmination siteTor the cDNA coding for the light chain, while 
convenient Hindi!! restriction sites are found upstream from tlie leader sequence and downstream from the_ 
termination site of the coding strand for the heavy chain. The leader sequences and coding sequences of 
the light and heavy chain variable regions ar free of sequences recognized by the indicated en- 
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donucleas s. 

The Isolated plasmid DNAs are digested with the respectiv endonuci ases in accordance with the 
instructions of the supplied and the resulting fragnnents pudfied by electrophoresis on 2% agarose gels 
(Seakem), 15cm x 15cm x 0.2cm. at 100V for 2h. By employing markers, the band of the appropriate 
5 molecular weight is located and excised. The gel slice Is placed directly into a 1.5ml Eppendorf tube, 
rapidly frozen and thawed twice in a Dry Ice-alcohol bath and then centrifuged 5min in the Eppendorf 
centrifuge (15,000 rpm) and the supernatant recovered. The supernatant is boiled in 6xSSC to denature the 
DNA and provide single strands, followed by cooling to O' C. 

Based on the DNA sequence, a DNA oligomer is prepared which Is at least partially complementary to 
10 a short sequence of each of the non-coding ("anti-sense") strands of the variable region sequences of the 
light and heavy chains. The oligomer has an initiating codon (ATG, for N-formyl-methionine [f-met]) at Its 5'- 
end and is complementary to the downstream nucleotides at the N-terminus of the leader sequence for 
primer repair; or has an f-met codon inteanediate Its ends and complementary sequences to the 3'-end of 
the coding sequence for the leader region and the 5*-end of the coding sequence for the variable regions 
for in vitro mutagenesis. The oligomers are readily prepared in accordance with the methods described by 
(takura"eral.. J.Biol. Chem. (1 975) 1 50:4592. 

The following schemes depict the primer repair synthesis method for the light and heavy chains where 
the leader sequence is retained (a and b. respectively) and the in vifro mutagenesis method where the 
leader sequence is removed and an f-met codon introduced at the N-terminus of the coding sequence for 
20 the variable regions of the light and heavy chains (c and d. respectively). The extended lines represent long 
DNA chains, the short dashes the terminations Introduced in these schemes. 



75 



25 



30 



35 



40 



45 



5'- 



ATG GAC ATG AGG 6CT- 



3'- 



-JTAC CTG TAC TCC CGA- 



5' 



1) Denature (i-e. separate the 
strands ) 

2) 5' - ATG GAC ATG AGG GC - 5' 
ATG GAC ATG AGG GC 



•TAC CTG TAC TCC CG- 



5» - ATG GAC ATG AGG GC 
3' - TAC CTG TAC TCC CG 



Klenow fragment + dNTPs (dATP, 
dCTP, dGTP and dTTP) 

31 



■5' 



50 



56 
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•ATG AAG TTG TGG- 
-TAC TTC AAC ACC- 



1) Denature 

2) 5' - ATG AAG TTG TGG 
ATG AAG TTG TGG 

-TAC TTC AAC ACC 



-3' 
-5' 



-5' 



Klenow fragment + dNTPs 

ATG AAG TTC TGG 3* 

TAC TTC AAC ACC 5' 



•GGT ACC AGA TGT GAC ATC CAG- 
-CCA TGG TGT ACA CTG TAG GTC- 



1) Denature . 

2) 5' - GGT ACC AGA TGG ACA TCC - 



GG TAC CAG ATG GAC ATC C 



-CC ATG GTC TAC CTG TAG G 

\/ 
T 

Klenow fragment + dNTPs 

GG TAC CAG ATG GAC ATC C 3' 

CC ATG GTC TA(^^TG TAG G 5* 



1) PstI linker + T^i Polynucleotide 

ligase 

2) Pst I 

3) PBR522 Pst I digest /Tl* Poly- 

nucleotide ligase 

4) Purify by cloning 



GG TAC CAG ATG GAC ATC C- 
CC ATG GTC TAC CTG TAG G" 



-3' 
-5' 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



51. 
5'" 



-GGT ATC CAG TOT GAG GTG- 
-CCA TAG GTC ACA CTC CAC- 



■3* 
-5' 



1) Denature 

2) 5' - GGT ATC CAG ATG GAG GTG - 3' 

ATG^ 
/ \ 
GGT ATC CAG GAG GTG 



CCA TAG GTC 



\ 



CTC CAC- 



-5' 



5' 
3^ 

5' 
3.* 



GGT ATC CAG 
CCA TAG GTC 



aca/ 

Klenow fragment + dNTPs 
/ATG 



'^ACA'^ 



GAG GT6- 
CTC CAC- 



3' 
51 



Treat as in c, sub-steps 1)- -4). 

- GGT ATC CAG ATG GAG GTG 3' 

- CCA TAG GTC TAC CTC CAC 5' 



To 0.5 fig of the single stranded DNA is added 15pmole of 5'-phosphorylated oligonucleotide as 
described in a and b above in 38 ;il of 200 mM of NaCI, 13mM Tris-HCI, pH7.5. 9mM Mg acetate, 20mM ^- 
mercaptoethanol. the mixture boiled for 3min and immediately cooled to 0*C. To this Is added ^fl\ of 
solution which contains the four deoxynucleoside triphosphates at 4mM. 0.1 ftl of lOOmM adenosine 
triphosphate, and 1 ti\ (1 unit) of the Klenow fragment of DNA polymerase 1 (Boehringer Mannheim). 

In this manner, strands coding for the 5'-leader sequence and coding sequence or just the coding 
sequence for the variable region are synthesized and the single-stranded DNA sequences in the 3'-direction 
of the template non-coding strand are degraded by the 3'-5'-exonucIease activity. As a result, for strands 
containing the leader sequence, homodupiexes are obtained for coding the leader sequence and variable 
regions for both the light and heavy chains, which are blunt ended, having an initiation codon at the 5*-end 
of the coding strand with the remaining DNA sequence In frame with the initiation codon. 

To the resulting blunt ended duplex coding for the leader sequence and variable region of the chains, 
restriction enzyme linkers are ligated through the use of appropriate phosphorylated linkers, for example, 
PstI linkers, employing T4 polynucleotide ligase under conditions specified by the supplier. The vector 
pBR322 Is cleaved with Pst! to provide cohesive ends for linking to tiie modified cDNA . 

Each cDNA is combined with the linear pBR322 having complementary termini. Equal molar amounts of 
the vector and cDNAs are combined in an annealing buffer essentially as described in Steinmetz et al. 
(1981) Cell. 24:125-134, and the annealed DNA used directiy for transformalton. 

OnelSl'of an overnight bacterial culture E coli strain HB101 (Boyer and Roulland-Dussiox (1969) J. 
Mol. Biol. 41:459-472) is grown to 2x1 0^ cells/ml in L broth, pelleted by centifugation (Sorval SS34 rotor, 
85;000rpm. 4* C, 5min) and washed in 0.5 volume cola lOmM CaCla. The cell pellet is resuspended in 0.5 
volume cold 30mM CaCh. After 20 min or ice, the cells are again pelleted and resuspended in 0.1 volume 
cold 30mM CaCla. Then 0.20ml of the suspension is added to 0.1ml 30mM CaCb containing tiie annealed 
plasmids and incubated on ice for I6min- Each transformation is then heated to 42' 0 for 75sec prior to the 
addition of 5ml L broth. 

Transformed cultures are incubated at 37* C for 2hr. The transformants are tiien grown in agar plates 
containing M-9 minimal medium and 10 /ig/ml tetracycline. Clones which grow on this medium are then 
transferred to agar plates having M-9 minimal medium and 40{iQ/n\\ of ampicitlin. Those cells which are 
sensitive to ampicillin and resistant to tetracycline are then screened for the presence of plasmids having 
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the desir d cDNA. 

The selected clon s are then grown in 2ml of nutrient culture for 18h. A 0.5ml aliquot is transferr d to a 
1.5ml Eppendorf tube for plasmid extraction. Manipulations are carried out at room temperature unless 
othenftrise indicated. The tube is centrifuged for 15sec., the supernatant carefully removed with a fine-tip 

5 aspirator, and the eel! pellet is thoroughly suspended in 100/il of a lysozyme solution containing 2mg/ml 
lysozyme.50mM glucose, lOmM EDTA. 25mM Tris-HCI (pHB.O). 

After a 30min incubation at 0*C, 200;il of alkaline SDS solution (0,2N.NaOH, 1% sodium dodecylsul- 
fate) is added and the tube is gently vortexed. The tube is maintained for 5min at 0*C and then 160 ii\ of 
3M sodium acetate (pH4.8) is added. After gently mixing by inversion for a few seconds, a clot of DNA 

10 forms and the tube is maintained at 0* C for 16min. After centrifugation for 5min. 0.4 mi of the supernatant 
is removed, transferred to a second centrifuge tube, 1ml cold ethanol added and the tube held at -20* C for 
30min. The precipitate is collected by centrifugation for 2min and the supernatant removed by aspiration. 
The pellet is resuspended in 100 ft\ 0.1 M sodium acetate. 200ftl ethanol added, and after lOmin at -20* C, 
the precipitate is again collected by centrifugation, and the pellet is dissolved in 50/il water. 

75 Substantially the same procedure as described above is used for in vitro mutagenesis. With the primer 
repair synthesis, only one homoduplex is formed; with in vitro mutagenesis, a heteroduplex is initially 
formed which upon transformation and cloning results in two homoduplexes: the original gene sequence; 
and the modified or "tailored" gene sequence, which includes the change in sequence encoded in the 
oligomer. 

20 As depicted in c and d, oligomers are prepared which introduce an initiation codon ATG, coding for f- 
met,at the N-terminus of the coding sequence for the variable regions. 

The resulting plasmid DNA is isolated as described above and used again as described above for 
transformation. However, the resulting transformants are grown in small (2ml) culture for plasmid isolation. 
The plasmid DNA prepared from single transfomnant colonies arising from the second cycle of cloning is 

25 assayed by filter blot hybridization on nitrocellulose filters (Wallace et al. (1979) Nucleic Acids Research 
6:3542-3556) probing with ^zp-radio-labeled oligomers employed for the""mutagenesis so as to ensure the 
isolation of the desired tailored homoduplexes of the cDNA. The clones having the tailored sequence are 
isolated and the plasmid DNA extracted for further processing at the 3*-end of the coding strand. 

The cDNA coding for the variable regions can be excised by digestion with Pstl. Repeating the 

30 technique described in the previous In vitro mutagenesis, where an ATG ("start") codon is introduced 
before the codon of the N-terminal arnino acid of the mature polypeptide, "stop" codons are introduced at 
the C-terminus of the variable regions. Oligonucleotides are prepared as described previously having 
complementary sequences to the coding ("sense") strand of the variable-region cDNA. 

The oligonucleotides and the schemes for inserting the stop codon at the end of the variable regions 

35 are depicted as follows. The introduction of the stop codon in the light chain is set forth in e, while the 
introducfon of tiie stop codon in the heavy chain is set forth in f. In e and f, the stop codon (TGA) is 
underiined. 



40 



45 



50 



55 
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QAA ATC AAA CGT OCT GAT GCT TCA CC- 
CTT TAG TTT GCA CGA CTA CGA CGT GG- 



1) Denature 

2) 3' - TTT GCA ACT ACG ACG TGG 



GC 

AAA CGT*^ \ GAT GCT GCA CC- 



TTT GCA' 



-A CTA CGA CGT GG - 5 



Klenow fragment + dNTPs 
AAA CGT T GAT GCT GCA CC - 5" 



TTT GCA. 



-A CTA CGA CGT GG - 5 ' 



i 



Treat as in c. 
AAA CGT TGA TGC TGC ACC - 3' 
TTT GCA ACT ACG ACG TGG - 5' 
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f . 



10 



15 



5'- 

3'' 



ACC GTC TCC TCA GCC AAA ACQ ACA- 
-TGC CAG AQG AQT CGG TTT TQC TGT- 



•3' 
-5' 



1) Denature 

2) 3' - GAG GAG TAG TTT TGT CTG T - 5' 
/GCC^ 

-ACC GTC TCC TCA AAA AC6 ACA 3' 



3' - G AGG AGT TTT TGC TGT - 5' 
^ACT 



Klenow fragment and dNTPB 
GCC 

-ACC CTC TCC TCA^ AAA ACG ACA - 3' 



20 



2S 



1 



G AGG AGT TTT TGC TGT - 5' 

\ 



^ACT 

Treat as in c. 
ACC CTC TCC TCA TGA AAA ACG ACA - 3V 



-TGG GAG AGG AGT ACT TTT TGC TGT - 5' 



30 



35 



40 



45 



■ The effect of the Kienow fragment and the four desoxynucleoside triphosphates is to degrade the 3*-end 
of the coding strand up to and including the first nucleotide unpaired with the oligomer and to extend the 3*- 
end of the oligomer complementary to the ^'-end of the coding strand; consequently, all of the sequence 
coding for the constant region, except for the few nucleotides paired with the oligonucleotide, is removed, 
but double strand DNA is built up in the opposite direction. 

The heteroduplexes having the "tailored" sequences of the variable regions of the light and heavy 
chains are then ligated to PstI linkers, restricted with PstI endonuclease and inserted into the PstI site of 
pBR322. After cloning and ricloning. the plasmlds contaJning the tailored ds cDNA with the stop codons at 
the end of the variable regions are isolated and the sequences coding for the variable regions (which may 
also include the leader sequences) are excised from the pBR322 plasmid using the PstI restriction 
endonuclease and may now be used for expression of the polypeptide chains of the rFv. 

In order to obtain expression of the variable regions, the plasmid pGMI (pVH253AtrpLE1413; Miozarri 
and Yanofsky, J. of Bacteriol . (1976) 133:1457-1466) is employed. The plasmid is modified to Introduce a 
PstI site which "provides for insertion of ^he sequences coding for the variable regions with the f-met codon 
ATG in proper position to the Shine-Dalgarno sequence. The following oligonucleotide sequence is 
prepared: 



AGCTGCAGCTTTCGTT. 



50 pCMl (10 (tg) is nicked in one strand by digestion with EcoRI (Boehringer Mannheim. 1000 units) in 1ml of 
lOOmM Tris-HCi, pH 7.2, 50mM NaCl, 5mM Mg. acetite. 0.01 percent NP-40 and 150 /tg/ml ethidium 
bromide at 37* C for 1h. The reaction mixture is brought to lOmlVI EDTA and then extracted with 3 x 10 
volumes water-saturated isobutanol, 1 x phenol-CHCIs, 2 x ether and 1 x isobutanol to reduce the volume to 
0.1ml. After desalting by centrifugation through a 0.5ml Sephadex G-25 column, the DNA is recovered by 

55 precipitation with ethanol. Approximately 5 fig of the nicked DNA is Incubated with 40 units of exonuclease 
III (BRL) in 20 it\ of lOmM Tris-HCI, ph 7.5, 2mM MgCb and 1mM j3-mercaptoethanol for 90min at 37* c. 
The reaction is adjusted to 15mM Tris-HCI, pH 7.5, 7mM NaCI, 7mM MgCb. 7mM dithiothreitol. After 
adding 20 units of bacterial alkaline phosphatase (BRL) and 5 units of Hinfl (BRL), digestion is continued for 
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30min. at 37* C. The mixture is brought to lOmM EDTA. extracted 2 x ph not-CHCb. 1 x ether and desalt d 
by centrifugation through 0.5ml Sephadex G-25 equilibrated with water. 

A major portion of the resulting circular ssDNA is combined with SOpmoie of the 5'-phosphorylated 
oligonucleotide, depicted above for Introducing the PstI site, in 38;il of 200mM NaCI. 13mM Tris-HCI, pH 

5 7.5. 9mM magnesium acetate. 20mM ^-mercaptoethanol, boiled for 30min and immediately cooled to 0* C. 
After adding 5 fil of a solution 4mM in the four dNTPs. 0.5 il\ of lOOmM ATP. 3pl (3 units) of DNA 
polymerase I (Klenow fragment) and 4^1 (10 units) of T4 DNA ligase. the mixture is incubated ovemight at 
12' C and then used directly for transformation of E. coli HB101; the transformants are grown, isolated and 
analyzed using blot hybridization employing radiolabeled 32p-oiigomer to detect clones having the tailored 

10 sequence containing the new PstI site. 

' The "tailored" pGMI is isolated and partially restricted with PstI. and the DNA sequences coding for the 
light and heavy chain variable regions prepared above are inserted individually into the tailored site to 
provide two plasmids having DNA sequences coding for the light (pGMIL) and heavy (pGMIH) chains, in 
accordance with the procedure described previously for insertion. The resulting plasmids are used to 

IS transform E. coli HB 101. and clones having the light and heavy variable region sequences in the desired 
orientation'areTdentified by restiction mapping and purified. 

Antisera recognizing the light and heavy chains respectively are produced by using the particular chains 
as immunogens. and the antisera isolated and covalently linked to Sepharose by conventional procedures 
(March et al.. Anal. Biochem. (1974) 60:149-152) and the products employed for affinity columns. 

20 Thelrahsformants are grown to cell densities of about 10^ cells/ml and collected by centrifugation. The 
pellet Is resuspended in 60 tt\ of 50mM Tris-HCI, pH8. 50mM EDTA, 15% sucrose, Img/ml lyzosyme, 0.5% 
NP40. After 30min at 0* C, lO^il of 150mM Tris-HCI. pH 7.5. 280mlVI MgCla. 4mM CaCb and l^ig DNase are 
added, followed by centrifugation for 15min at 12.000g. 

The protein is then isolated by removal of the supernatant from the pellet and the supematants are 

25 passed over the immunosorbent columns (0.15ml) equilibrated with Tris-HCI. pH 7.5. The light and heavy 
chains of the rFv are eluted with 1M acetic acid, pH 2.5 and the eluates pooled and neutralized with 0.1 M 
NaOH at 0* C to pH 5.5. The pooled eluates are dialyzed against 3 x 100 volumes of sodium acetate buffer, 
pH 5.5, followed by 3 x 100 volumes PBS, pH 7. 

If the renatured light and heavy chains of the rFv have the same source, they can be further purified by 

30 combining the eluates containing the rFv components and passing them over a DNP-affinity column. (In the 
present example, different sources of heavy and light chains are described, so that this seep is omitted.) A 
DNP-affinity column and procedure are described by Koolstra and Richards. Biochem . (1978) 17:345-351. 
The bound rFv is isolated by elution with 1IVI acetic acid, followed by renaturing with sequential dialysis as 
described above. In addition, sulfhydryl groups may be capped with iodoacetamide as described by 

35 Kooistra and Richards, ibid. 

The present method provides protein complexes of homogeneous composition having two peptide 
chains which form a complex having high binding affinity for a predetermined haptenic site. The two chains 
form an rFv having specificity for a particular ligand, by mimicking a naturally occumng immunoglobulin. 
Without the constant regions, the resulting rFv has reduced immunogenicity and lacks peptide sequences 

40 which may have undesirable functions for particular applications e.g. complement fixation. 

The rFv can be used for a variety of pui-poses and diagnosis and therapy. The composition is 
homogeneous and therefore has a fixed reproducible level of immunogenicity. Also, owing to the reduced 
molecular weight, it will have relatively short residence times after injection into a mammalian host. This is 
particularly important where tiie rFv is labeled for diagnosis or therapy employing hazardous labels, such as 

45 radionuclides, heavy metals, cytotoxic agents, and the like. Short residence times can also be important 
where the rFv is used to inhibit physiologically active materials in vivo e.g. hormones, enzymes, surface 
receptors, lymphocytes or other cells, and the like. 

The uniform composition allows for controlled labeling, enhancing tiie ability of a conjugate to label a 
particular site on one or the other or both of the chains. The uniformity permits controlled conjugations. 

50 accurate determinations of therapeutic activity, easy monitoring of therapeutic effect, enhanced reproduc- 
ibility of results and control and ease of monitoring of side effects. 

The present invention provides for accurate syntiiesis of polypeptide chains which can be brought 
together to fonn a binding site for a predetermined epitopic site. The light and heavy chains prepared by 
the present metiiod can be brought together in the presence or absence of the ligand. Also, the invention 

66 permits introduction of a particular amino acid at either terminus of a chain for particular applications, e.g. 
tyrosine for radioiodination. By using monoclonal hybridomas as the source of the DNA for coding the 
variable regions, tiie naturally occurring binding efficiency is retained and binding affinity can be widely 
varied. • 
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Claims 

1, A specific binding composition comprising two polypeptide chains of an immunoglobulin having binding 
specificity to a predetemnined ligand, characterized in that said two polypeptide chains 

5 (1) have the amino acid sequence of at least a portion of the variable region of an Immunoglobulin, 

but substantially lack the constant region, 

(2) associate to form a complex having a high affinity and specificity for said predetermined llgand, 
and 

(3) are the light chain of from about 95 to 115 amino acids and the heavy chain of from about 110 to 
70 125 amino acids, wherein said heavy chain includes the D-reglon. 

2. A composition as claimed in claim 1 wherein each of said chains is labelled with a functionality capable 
of producing a detectable signal. 

75 3. A composition as claimed in claim 2, wherein said functionality is a cytotoxic agent or a radionuclide. 

Revendlcations 

1. Composition sp^cifique de liaison comprenant deux chatnes poiypeptidiques d*une Immunoglobuline 
20 ayant une specificity de liaison pour un ligand predetermine, caract^ris^e en ce que lesdites deux 
chatnes poiypeptidiques 

(1) ont la sequence d'acides amines d'au moins une portion de la region variable d'une immunoglo- 
buline, mais manquent sensiblement de la region constante. 

(2) s*associent pour former un complexe ayant une forte affinite et une forte specificite pour ledit 
25 ligand pr^d^tenmine, et 

(3) sont la chaTne \6ghre d'environ 95 Si 115 acides amines et la chaTne lourde d'environ 110 Si 125 
acides amines. oO ladite chaTne lourde comprend la region D. 

2- Composition selon la revendication 1 , oO chacune desdites chatnes est marquee d'une fonctionnalite 
30 capable de produire un signal detectable. 

3- Composition selon la revendication 2, oCi ladite fonctionnalite est un agent cytotoxique ou un radionucli- 
de. 

35 Anspriiche 

1. Spezifische Bindungszusammensetzung, umfassend zwei Polypeptid-Ketten eines Immunglobulins mit 
Bindungsspezifitat gegenOber einem vorbestimmten Liganden. dadurch gelcennzeichnet, dafi die zwei 
Polypeptid-Ketten 

40 (1) die Aminosauresequenz wenlgstens eines Teils der variablen Region eines Immunglobulins 

aufweisen. ihnen aber im wesentllchen die l<onstante Region fehlt. 

(2) assoziieren unter Bildung eines Komplexes, der hohe Affinitat und Spezifit^t gegenQber dem 
vorbestimmten Uganden aufweist, und 

(3) die leichte Kette von etwa 95 bis 115 Aminosauren und die schwere Kette von etwa 110 bis 125 
45 Aminosauren darstellen, wobei die schwere Kette die D-Region einschlieflt 

2, Zusammensetzung nach Anspruch 1, wobei eine jede Kette mit einer Funktionalitat versehen ist, die ein 
meflbares Signal erzeugen kann. 

50 3. Zusammensetzung nach Anspruch 1. wobei die Funktionalitat ein cytotoxisches Agens Oder ein 
Radionuklid ist. 
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